Pengembangan Engine Integrasi Tabel HTML pada Halaman Web
نویسندگان
چکیده
منابع مشابه
TabEL: Entity Linking in Web Tables
Web tables form a valuable source of relational data. The Web contains an estimated 154 million HTML tables of relational data, with Wikipedia alone containing 1.6 million high-quality tables. Extracting the semantics of Web tables to produce machine-understandable knowledge has become an active area of research. A key step in extracting the semantics of Web content is entity linking (EL): the ...
متن کاملCOMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data
Today’s web search engines are still following the paradigm of keyword-based search. Although this is the best choice for large scale search engines in terms of throughput and scalability, it inherently limits the ability to accomplish more meaningful query tasks. XML query engines (e.g., based on XQuery or XPath), on the other hand, have powerful query capabilities; but at the same time their ...
متن کاملAn extensible rendering engine for XML and HTML
XML has been proposed in order to bring to the web a markup language free of the shortcomings of HTML, in particular the inextensibility of the set of valid elements (tags). Stylesheet languages have been proposed for XML, in order to provide precise and sophisticated typographical control over the appearance of text-based data. We have developed a rendering engine for HTML and XML documents, p...
متن کاملA Query Engine for Retrieving Information from Chinese HTML Documents
The amount of online information in Chinese and the number of Chinese Internet users have been increasing tremendously during the past decade. Since Chinese language is significantly different from English, techniques that have been developed for retrieving information from English Web documents cannot be directly applied to retrieve information from Chinese Web documents. In order to provide h...
متن کاملBeyond HTML: Web-Based Information Systems
In this paper we briefly review current status of Web-Based Information Systems (WBIS) and present a number of different WBIS technologies and how they are being integrated during the ARHON project (Archiving, Annotation and Retrieval of Historical Documents). The project amounts to transforming a vast collection of OCR-resisting historical manuscripts belonging to the Vikelaia Municipal Librar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Jurnal Nasional Teknik Elektro dan Teknologi Informasi (JNTETI)
سال: 2016
ISSN: 2301-4156,2301-4156
DOI: 10.22146/jnteti.v5i3.254